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Abstract 

Small world models are networks consisting of many local links and 
fewer long range 'shortcuts', used to model networks with a high degree of 
local clustering but relatively small diameter. Here, we concern ourselves 
with the distribution of typical inter-point network distances. We estab- 
lish approximations to the distribution of the graph distance in a discrete 
ring network with extra random links, and compare the results to those 
for simpler models, in which the extra links have zero length and the ring 
is continuous. 
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1 Introduction 



There are many variants of the mathematical model introduced by Watts and 
Strogatz ,15j to describe the "small-world" networks popular in the social sci- 
ences; one of them, the great circle model of Ball et. al. j4j , actually precedes ^Hl ■ 
See pp for a recent overview, as well as the books jS] and A typical descrip- 
tion is as follows. Starting from a ring lattice with L vertices, each vertex is 
connected to all of its neighbours within distance k by an undirected edge. Then 
a number of shortcuts are added between randomly chosen pairs of sites. In- 
terest centres on the statistics of the shortest distance between two (randomly 
chosen) vertices, when shortcuts are taken to have length zero. 

Newman, Moore and Watts ^^Ij |^ proposed an idealized version, in which 
the lattice is replaced by a circle and distance along the circle is the usual 
arc length, shortcuts now being added between random pairs of uniformly dis- 
tributed points. Within their [NMW] model, they made a heuristic computation 
of the mean distance between a randomly chosen pair of points. Then Barbour 
and Reinert [7] proved an asymptotic approximation for the distribution of this 
distance as the mean number Lp of shortcuts tends to infinity; the parameter p 
describes the average intensity of end points of shortcuts around the circle. In 
this paper, we move from the continuous model back to a genuinely discrete 
model, in which the ring lattice consists of exactly L vertices, each with con- 
nections to the k nearest neighbours on either side, but in which the random 
shortcuts, being edges of the graph, are taken to have length 1; thus distance be- 
comes the usual graph distance between vertices. However, this model is rather 
complicated to analyze, so we first present a simpler version, in which time 
runs in discrete steps, but the process still lives on the continuous circle, and 
which serves to illustrate the main qualitative differences between discrete and 
continuous models. This intermediate model would be reasonable for describing 
the spread of a simple epidemic, when the incubation time of the disease is a 
fixed value, and the infectious period is very short in comparison. In each of 
these more complicated models, we also show that the approximation derived 
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for the [NMW] model gives a reasonable approximation to the distribution of 
inter-point distances, provided that p (or its equivalent) is small; here, the er- 
ror in Kolmogorov distance is of order 0{p^ log(i)), although the distribution 
functions are only 0{p) apart in the bulk of the distribution. 

2 The continuous circle model for discrete time 

In this section, we consider the continuous model of P', which consists of a 
circle C of circumference L, to which are added a Poisson Po {Lp/2) number 
of uniform and independent random chords, but now with a new measure of 
distance between points P and Q. This distance is the minimum of ^(7) over 
paths 7 along the graph between P and Q, where, if 7 consists of s arcs of 
lengths li,...,ls connected by shortcuts, then ^(7) J2t=l\^r^, where, as 
usual, [/] denotes the smallest integer m > I; shortcuts make no contribution 
to the distance. We are interested in asymptotics as Lp — > 00, and so assume 
throughout that Lp > 1. 

We begin with a dynamic realization of the network, which describes, for 
each n > 0, the set of points R{n) C C that can be reached from a given 
point P within time n, where time corresponds to the d(-) distance along paths. 
Pick Poisson Po (Lp) uniformly and independently distributed 'potential' chords 
of the circle C; such a chord is an unordered pair of independent and uniformly 
distributed random points of C. Label one point of each pair with 1 and the 
other with 2, making the choices equiprobably, independently of everything 
else. We call the set of label 1 points Q, and, for each q G Q, we let q' — q'{q) 
denote the label 2 end point. Our construction realizes a random subset of these 
potential chords as shortcuts. We start by taking i?(0) = {P} and B{0) — 1, and 
let time increase in integer steps. R{n) then consists of a union of B{n) intervals 
of C, each of which is increased by unit length at each end point at time n -I- 1, 
but with the rule that overlapping intervals are merged into a single interval; 
this defines a new union of B'{n + 1) intervals R'{n + 1); note that B'{n + 1) 
may be less than B{n). 
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Now define dR{n + 1) := R'{n + 1) \ R{n). Whenever dR{n + 1) n Q is 
not empty — that is, whenever dR{n + 1) includes label 1 points — then, for 
each q e dR{n + 1) n Q, we accept the chord {q,q'} if q' = q'{q) ^ R'{n, + 1) 
(that is, if the chord would reach beyond the cluster R'{n + 1)), wc reject it if 
q' £ R{n), and we accept the chord {q, q'} with probability 1/2 if q' G dR{n + l), 
independently of all else. Letting Q{n+1) := {q' : {q, q'} newly accepted}, take 
R{n+l) = R'{n + l)UQ{n + l) and set B{n+1) = B'{n + l) + \Q{n+l)\. Note 
that B{n+1) may be either larger or smaller than B{n), and that S[l/2] — 1 sl.s. 

After at most \L/2] time steps, each of the potential chords has been either 
accepted or rejected independently with probability 1/2, because of auxiliary 
randomization for those chords such that {q, q'} G dR{n) for some n, and be- 
cause of the random labelling of the end points of the chords for the remainder. 
Hence this construction does indeed lead to Po [Lp/2) independent miiform 
chords of C. 

For our analysis, as in [7], we define a second process 5'(n), starting from the 
same P and the same set of potential chords, and with the same unit growth per 
time step. The differences are that every potential chord is included, so that 
no thinning takes place, and, additionally, whenever two intervals intersect, 
they continue to grow, overlapping one another, and each continues to generate 
further chords according to a Poisson process of rate p. This pure growth process 
S{n) agrees with the original construction during the initial development with 
high probability, until S has grown enough that overlap becomes likely; its 
advantage is that it has a branching structure, and is thus much more easily 
analysed. We denote its length at time n by s{n) > r{n), overlaps now being 
counted according to multiplicity, and the number of intervals by M{n) > B(n). 
Then M{n) is just a pure birth chain with offspring distribution 1 + Po (2/?), so 
that EM{n) = (1 + 2/3)", and the total length of the M{n) intervals is given by 

ri-l 

so that 
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Furthermore, 

W{n) := (l + 2p)-"M(n) 

forms a square integrable martingale, so that {l+2p)^"AI{n) Wp a.s. for some 
Wp such that Wp > a.s. and EWp = 1. Hence also (1 + 2p)-"s(n) p^^Wp 
a.s. and p^^ a.s.. Note also that VarM^(n) < 1. 

Our strategy is to pick a starting point P, and run both constructions up to 
an integer time t^, chosen in such a way that R(n) and S{n) are (almost) the 
same for n < Tr- Pick 

log (Lp) 
"° L21og(l + 2p)_ 

where [x\ denotes the largest integer no greater than x, and let 

00 :=</>o(i,p)-(ip)-'/'(l + 2pr, 

so that (1 + 2p)"» = (j)a\/Tp and (1 + 2p)"i < 0o < 1; note that (?!)o w 1 if p is 
small. Now let = no + r, and assume that 

'-'^ 61og(l + 2p) ^°g(^^)' (^-l) 

implying in particular that < 3 iog^{^2p) • Then, writing — R{Tr),Sr = 
S(t^),Mj. = M{tj.), and s,. ~ s(Tr), we have 



F,Mr^(j)oVLpil + 2pY 

and 



Es, = p-i(0oyi;^(l + 2p)'--l). 

Next, independently and uniformly, we pick a second point P' G C, and 
a second set of potential chords, Q', and run both constructions for time t^', 
where r' also satisfies (|2.1|l . yielding R'^,, , S'^, , M^, —: N^' and s'., =: u^'. Then, 
at least for small p, there are about (f>QLp{l + 2pY^^ pairs of intervals, with 
one in Sr and the other in S*^, , and each is of typical length p^^, so that the 
expected number of intersecting pairs of intervals is about 

-^02^^(1 + 2pY+'''' = 20^(1 + 2pY+^\ 
Lp 



which, in the chosen range of r, r', grows from ahiiost nothing to the typically 
large value 2(/)§(Lp)^/'^. For later use, label the intervals in Sr as /i, . . . jIm^, 
and the intervals in S'^, as Ji , . . . , Jjv , ; then we can write the number Vr,r' of 
intersecting pairs of intervals as 

Vry (2.2) 

where 

= l{h n Jj ^ 0}. (2.3) 

Now the probability that = is the same as when the construction for 
iS" uses the original set Q of potential chords, because of the independence of 
Poisson processes on disjoint subsets; the event Vr,r' — indicates that the two 
processes have no intersecting pairs of intervals when stopped at the times r^, 
Tr' , and thus use disjoint sets of chords. Furthermore, we can show that the event 
Vr.r' = is with high probability the same as the event Vr,r' — 0, where Vry is 
the number of intersections of R{r) and R'ir'). Finally, if R{r) and R'ir') have 
no intersections, then the "small worlds" distance between P and P' is more 
than 

Tr + Tr' = 2^0 + T + r' . 

Hence we have solved the problem if we can find a good approximation to the 
probability that Vry — 0; this we do by showing that Vr.r' approximately has 
a mixed Poisson distribution, and by identifying the mixture distribution. We 
usually take r = r' or r = r' + 1, the latter to allow for the possibility of the 
number of steps in the shortest path being odd. 

After this preparation, we are in a position to summarize our main results. 
These are treated in more detail in the next section, in Theorem 13. 91 Corollary 
13.101 and Theorem 13. 151 We let D denote the small worlds distance between a 
randomly chosen pair of points P and Q on C, so that, as above, 

V[D > 2no + r + r'] = P[K,r' = 0]. 
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The following theorem approximates the distribution of D by that of another 
random variable _D*, whose distribution is more accessible; in this theorem, p 
and the derived quantities (j)o,nQ, Nq and xq all implicitly depend on L, as does 
the distribution of D* . 

Theorem 2.1 Let A denote a random variable on the integers with distribution 
given by 

P[A>x]=-E{e~^^oa+2prw,w^^^ 

and set D* = A + 2no. If Lp oo and p = p{L) = 0{L'^), with (3 < 4/31, then 
dTv{C{D)X{D*))~^Q as L ^ oo. 

1. Ifp is large, let Nq be such that {l + 2p)^o < Lp < (l + 2p)^"+\. and define 
a e [0, 1) to be such that Lp^ {1 + 2p)^°+°' ; then, with xq^ Nq- 2nQ + 1, 

P[A>xo] > l-2(l + 2p)-"; 
P[A>a;o + l] = 0((l + 2p)-i+"log(l + p)), 

so that A concentrates almost all its mass on xq, unless a is very close 
to 1. 

2. If p 0, the distribution of pA approaches that of the random variable T 
defined in J^, Corollary 3.10: 

P[pA >x]^P[T>x]= / dy. 

Jo 1 + 2e^^y 

The errors in these distributional approximations are also quantified, for given 
choices of L and p{L). 

This result shows that, for p small and x — lp with Z G Z, 

PlpA>x] - ^{^~2c^l(l+2pr^-W,W^^ 

« E{e-2-'"'^'^'} = P[T>a;], (2.4) 

where W and W are independent NE(1) random variables. Indeed, it follows 
from Lemma 13.131 below that Wp as p ^ 0. One way of realizing a 
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random variable T with the above distribution is to reahze W and W, and 
then to sample T from the conditional distribution 

P[T>x\W,W'] - e-^e-^'ww' 

— g-cxp{22;+log2+logVK+log W'} 

= g- cxp{2a;+log2-Gi-G2} (2 5) 

where Gi := — logM^ and G2 ■= — logM^' both have the Gumbel distribution. 
With this construction, 

P[2T - {Gi + G2 - log 2} > x I W, W] = e-"-' , 

whatever the values of W and W\ and hence of G\ and G2, implying that 

2T =p Gi+G2-G3-log2, 

where G\ , Gi and G3 are independent random variables with the Gumbel dis- 
tribution. The cumulants of T can thus immediately be deduced from those of 
the Gumbel distribution, given in Gumbel [S]: 

ET = 1(7 -log2)« -0.058; 

v.. . '1. 

Note that, in view of CoroUarv 13 . 21 below, the conditional construction (|2.5|l 
can be interpreted in terms of the processes S and S", since Wp and W'p are 
essentially determined by the early stages of the respective pure birth processes, 
and the extra randomness, conditional on the values of Wp and Wp, comes from 
the random arrangement of the intervals on the circle C. 

In the NMW heuristic, the random variable Tmmw is logistic, having dis- 
tribution function e^^(l + e^^)^^; note that this is just the distribution of 
i(Gi — G3). Hence the heuristic effectively neglects some of the initial branching 
variation. 
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3 The continuous circle model: proofs 



The first step in tlie argument outlined above is to establish a Poisson approxi- 
mation theorem for the number of pairs of overlapping intervals, one in Sr and 
the other in S'^., . The following result has been shown in [7] . 

Proposition 3.1 Let M intervals Ii, . . . , Im with lengths ii, . . . , tM and N in- 
tervals Ji, . . . , Jn with lengths ui, . . . , un be positioned uniformly and indepen- 
dently on C. Set V :— X^fii '^f=i -^ij' where Xij I[Ii n Jj ^ 0]. Then 

where \M,N,t,u) L^^{Nt + Mu), t := J2fLi^t' Y.j'=i'^j "-'^^ Vtu ■= 

maxjmaxi ti, maxj Uj}. 

The proposition translates immediately into a useful statement about V^,r', 
when P' is chosen uniformly at random, independently of all else. 

Corollary 3.2 For the processes S and S' of the previous section, we have 

\P[Vry = I Mr = M, Nr' = iV, = t, u,.' ^ u] ~ exp{-L"^(iVt + Mu)}\ 

< 8L-'^{MTr+ NTr'). 

Remark. If P' is not chosen at random, but is a fixed point of C, the result of 
CoroUarv 13 . 21 remains essentially unchanged, provided that P and P' are more 
than an arc distance of + r^' apart. The only difference is that then Xn — 
a.s., and that Nt + Afit is replaced by Nt + Mu — 2Tr — 2Tr'. If P and P' are 
less than + r^' apart, then PlVr.r' = 0] = 0. 

The next step is to show that P[l/^^r' = 0] is close to P[T4,r' = 0]. We 
do this by directly comparing the random variables Vr,r' and Vr.r' in the joint 
construction. As for Corollary 3.5 in |7], the following assertion can easily be 
shown to hold. 

Proposition 3.3 With notation as above, we have 

P[Vry ^ Vry] < 32TrTr'L~^'E{^Mr.Nr'iMr + N^-' - 2)}. 
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To apply Corollary 13.21 and Proposition 13.31 it remains to establish more 
detailed information about the distributions of Mr and Sr- In particular, we 
need to bound the first and second moments of Mr, and to approximate the 
quantity E{exp{—L^^{Nr' Sr + MrUr')}). We begin with the following lemma. 

Lemma 3.4 The random variable M{n) has as probability generating function 

Gm(„)(s) := Es^'^(") = f{s) - 

where denotes the nth iteration of f . In particular, we have 

BMr = <t)(i^/L'p{l + 2pY 
^^Mr{Mr-l) = (/9+l)(^oV^(l + 2p)'-l{0o/^(l + 2p)'--l} 

< 4>lLp[l + 2pf\ 

Proof: Since M{n) is a branching process with 1 + Vo{2p) offspring distri- 
bution, the probability generating function is immediate, as are the moment 
calculations 

EAf (n) = (1 + 2pY- 
EM(n)(M(n) - 1) = 2(p + 1)(1 + 2p)"-i {(1 + 2p)" - 1} . 

The moments of Mr follow from the definition of r^. [] 

These estimates can be directly applied in Corollarv l3 .21 and ProDOsition l3.3l 
Define 

m{r,r') M{p{n^ + {r V r'))Y 4>l{l + 2pY+^'+^^''-'^ (3.1) 

772 (r,r') := 16{p(tio + (r V r'))}0o(l + 2p)(''^'-'). (3.2) 

Corollary 3.5 We have 

V[%r' ^Vr,r'] <Vi{r,r'){Lp)-^'^ 

and 

|P[V^„, =0]-Eexp{-i-i(A^,,s, + AW)}| < {vi{ry) + ri2{ry)}{Lp)-^'^ . 
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Consideration of the quantity E(exp{~L ^{Nr'Sr + MrUr')}) now gives the 
immediate asymptotics of 

P[Vr,^, = 0] = P[D >2no + r + r'], 

where D denotes the "small world" distance between P and P' . 

Corollary 3.6 If p ~ p{L) is bounded above and Lp — > oo, then as L ^ oo, 

\P[D >2no + r + r'] - Eexp{-202(1 + 2pY+''' WpW'p}\ -> 

uniformly in |r|' < giog(i_|.2p) ^"S (^P); where Wp and Wp are independent 
copies of the limiting random variable associated with the pure birth chain M . 

Proof: The conditions ensure that and r^' both tend to infinity as L — > cx), at 
least as fast at clog(Lp), for some c > 0. Then, since W{n) = {l+2p)^^M{n) 
Wp a.s. and s{n)/M{n) p-^ a.s., and since (1 + 2p)^-+^'-' = (j)l{l + 2pY+''' Lp, 
it is clear that 

exp{~L^'^{Nr'Sr + MrUr')} ^ exp{ -2(Lp) " ^M^iY^/ } 

= exp {-2(Lp)-i(l + 2py"+^r'W{Tr)W'iTr>)} 
^ exp{-202(i + 2p)'-+'-VpW^;}, 

uniformly for r, r' in the given ranges. [] 

Hence P[D > 2no + r + r'] can be approximated in terms of the distribution 
of the limiting random variable Wp associated with the pure birth chain A/. 
However, in contrast to the model with time running continuously, this distri- 
bution is not always NE (1), but genuinely depends on p. Its properties are not 
so easy to derive, though moments can be calculated, and, in particular, 

EVFp = 1; Var Wp = 1/(1 + 2p); (3.3) 

it is also shown in Lemma f3. 131 that C{Wp) is close to NE(1) for p small. We 
also need the following lemma, which is useful in bounding the behaviour of the 
upper tail of C{D). 
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Lemma 3.7 For all 0,p > 0, 

E(e-»>^p>^;) < 0-Hog{l + e). 
Proof: The offspring generating function of the birth process M satisfies 

f{s) = se^"^^-'^ < s{l + 2p{l - s)}-' =: 
for aU < s < 1. Hence, with rn = 1 + 2p, 

E(e-'/'M'p) lim /(")(e-'^™"") < hm /("^(e"''™"") = (1 + V)"^ (3.4) 

n — >OG n — >oo 

The last equahty foUows from (8.11), p. 17 in jJU], noting that the right-hand 
side is the Laplace transform of the NE(1) - distribution. Furthermore, we have 



Jo 



e e 



and so, applying (|3.4|l twice, and because the function {l + t) ^ is decreasing in 
i > 0, we obtain 

E(e-^^^^p) < E{(1 + 0Wp)-i} 

= 0-' / Ee-'^-e-'/^dt 







< 0-^ / {l+t)-^e-'^Ut 







< 0-' I il + t)-'dt = 0-'logil + 0) 

as required. [] 

The simple asymptotics of Corollary 13.61 can be sharpened. At first sight 
surprisingly, it turns out that it is not necessary for the times and Tr> to tend 
to infinity, since, for values of p so large that uq is bounded, the quantities W{n) 
are (almost) constant for all n. Write 

n-l 

(l + 2p)-"s(n) = 2^M^(j)(l + 2p)-("-^") 

3=0 

= -W{n) + ^^U{n), (3.5) 
P P 
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where 

1 4- 

—^Uin) = 2Y(Wij)-W(n)){l + 2p)-^''-^^ -W(n)p-\l + 2py. 
Computation gives Ef/(n) = -(1 + + 2/9)~", and 

mwin) - wmwin) - wm = ^^^^ (l - ^^^1-^ 

if j > ^, so that 

2(l + 2p)-" + 2(l + 2p)-2" 



Yav{U{n)} < 2- 



(1+P) 
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, (1 + 2p)-" 
- ' (1 + p)^ ' 

and thus 

(l+p)2E{;7(n)2}<9(l + 2p)-". (3.7) 

Then we have 

L~'^{Nr'Sr + MrUr') 

= cg{l + 2py+'^'{W{Tr){W'{Tr') + (1 + p)Ul,) + W'{Tr'){W{Tr) + (1 + p)Ur)}, 

where W{Tr) := W{Tr) and Ur ■= Uijr), so that, by Taylor's expansion, and 
because 'EiW{n) = 1 for all n, 

-Ee^^{-L-^{Nr'Sr + MrUr')] - Eexp{-20g(l + 2pY+'''W{Tr)W'{Tr')] 

< 4>l{l + 2pr+^\l + p){nUr\ + E|C/;,|} (3.8) 

and 

Eexp{-2<^2(i ^ 2p)'-+'-V(T,)W(r,0} - Eexp{-2<^2(1 + 2pY+'''WpWl} 

< 2(I>1{1 + 2pY+'-'{E\Wp - W{Tr)\ + E\W'p - W'{Tr')\}. (3.9) 

Using these results, we obtain the following theorem. 
Theorem 3.8 If P' is randomly chosen on C, then 

P[D>2n„+r + r'] - E{e-2-^o(i+2p)'^+'^V,w;| 

< {vi{r,r') +V2{rymLp)-'/^ +V3{ry){Lp)-'/' 
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where 771,772 are given in ^tl.l\l and i^cl.S^} . 

and where, as before, D denotes the shortest distance between P and P' on the 
shortcut graph. 

Proof: Since {Vr,r' = 0} = {D > 2no + r + r'}, we use Corollary EiSl and (|3.8|l 
and (|3.9|l to give 

P[D > 2no + r + r']- E{e-2^o(i+2p)'-+'-VpW'/| 
< {771 (r, r') + rj2{r, r')}{Lp)-'^' + 0^(1 + 2py+^' {1 + p){E\Ur\ + E|C/;,|} 
+ 202(1 + 2pY+^'{-E\Wp - W{Tr)\ + E|l¥; - W'{Tr,)\}. (3.10) 

Now, from H3.7() and the Cauchy-Schwarz inequality, 

(1 + p)mr\ < 3(Lp)-i/Vo '^'(1 + 2p)-'-/^ (3.11) 

Then, since W{n) is a martingale, and 

oo 

Wp-W{n) = ^(W^(^ + 1) - VF(^)) 

oo 

= ^(1 + 2p)-'-\M{£ + 1) - (1 + 2p)M{£)), 
e=n 

we have 

oo 

E{Wp-W{n))^ = ^(l + 2p)^2(£+i)E(j^^(^+l)_(l + 2p)M(f))2. 

i=n 

Now 

M(£) 

M{£+1) = ^(Z,+i(z) + l), 

i=l 

where {Zi{i))f^i are i.i.d. Po (2p)-variates, and so 

E{M{i+l)- {l + 2p)M{i)f = EVar(A/(£ + l)| A/(^)) 

= 2pE,M{£) ^ 2p{l + 2pY, 
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implying that 

oo 

E{Wp~WiTr)f < 2p(l + 2p)-2 ^(l + 2p)- 

= (l + 2p)-2(l + 2p)--'-. 

Hence 

E\Wp-W{rr)\ < (1 + 2p)-i-5(»o+'-) 
and the theorem follows. fl 



Theorem 13 . 81 can be translated into a uniform distributional approximation, 
as follows. 

Theorem 3.9 If A denotes a random variable on the integers with distribution 
given by 

P[A>a;] =E{e-2'^o(i+2prH^pM/p'}^ ^ g Z, (3.12) 
and D* = A + 2no, then 
dTv{C{D),C{D*)) 

- O (log(Lp)(l + 2p)V4(Lp)-* + (1 + 2pY'\Lp)-^^ . 

In particular, for p = p{L) = OiLl^) with /3 < 4/31, 

dTv{C{D),CiD*)) ^0 as L ^ oo. 

Proof: It is easy to see that A, defined as above, is indeed a random variable. 
Its upper tail is bounded by Lemma 13.71 which implies that 



1 

20g' 



P[A >x]< — 2(l + 2p)-^(2 + xlog(l + 2p)) (3.13) 



for any x > 0, since 0o < 1 and log(l + 2y) < 2 + logy in y > 1. Then, for 
any x £ Z, writing r{x) — [x/2\ and r'{x) — x — r{x) < (a; + l)/2, it follows 
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from Theorem 13 . 81 that 



\P[D > 2no + x]- P[D* > 2no + x]\ 

<{miri^),r'ix))+V2irix),r\x))}{Lp)-'/^+f^3{r{x),r'{x)){Lpr'/^ 



2 



p\og{Lp)_ 

p\og{Lp) 



.log(l + 2p) 

^log(Lp)-21og 00 



{l + 2p)^Lp)-^ + {l + 2p)HLp)-^ 



so long as 2: < L '' \o^{i+2p)^ -I • This is combined with (|3.13|) evaluated at 
X — \ '' '°^iig(j4-2 j"^ 1 ' which gives rise to a term of order O (j^Lp)^^ log(Lp) 
and the main estimate follows. 

The above bomid tends to zero as L ^ oo as long as p = p{L) = 0[L^) for 
(3 < 4/31. Thus the theorem is proved. [] 



For larger p and for L large, it is easy to check that uq can be no larger than 4, 
so that interpoint distances are extremely short, few steps in each branching 
process are needed, and the closeness of C{D) and C{D*) could be justified 
by direct arguments. Even in the range covered by Theorem 13.91 it is clear 
that C{D) becomes concentrated on very few values, once p is large, since the 
factor 20q(1 + 2pY in the exponent in (|3.12l) is multiplied by the large factor 
(1 + 2p) if X is increased by 1. The following corollary makes this more precise. 

Corollary 3.10 //A^o *s such that 

(l + 2p)^" <Lp<{\ + 2pf»+\ 

and if Lp = (1 + 2^)^"+", for some a £ [0, 1), then, taking Xq — Nq — 2nQ + 1, 
we have 

P[A>2;o] > l-2(l + 2p)^", 

and 

P[A > :eo + 1] - Eexp{-2(1 + 2p)^-''WpW'p} < ^(1 + 2p)-i+" log(3 + 4p). 
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Proof: The result follows immediately from Jensen's inequality; 

Eexp{-2(l + 2p)-"W^pW;} > cxp {-2(1 + 2p)-"ET4^^EM^;} 

> l-2(l + 2p)-" 

as EWp = 1, and from Lemma UTTI with 6* = 2(1 + 2p)i-". [] 

Thus the distribution is essentially concentrated on the single value xq if p is 
large and a is bounded away from and 1. If, for instance, a is close to 1, then 
both Xq and xq + 1 may carry appreciable probability. 

li p po as L oo, then the distribution of A becomes spread out over Z, 
converging to a non-trivial limit as L — s- oo along any subsequence such that 
</>o(i,p) converges. Both this behaviour and that for larger p are quite differ- 
ent from the behaviour found in the continuous model of However, if p 
becomes smaller, the differences become less; we now show that, as p — > 0, the 
distribution of pA approaches the limiting distribution of T obtained in |[7| . 

The argument is based on showing that the distribution of Wp is close 
to NE (1). To do so, we employ the characterizing Poincare equation for Galton- 
Watson branching processes (see Harris TU', Theorem 8.2, p. 15); if 

MO) = Ee-'^^" 

is the Laplace transform of £{Wp), then 

0p((l + 2p)^?) = /(0p(0)). (3.14) 

We show that when p w then (j>p{6) is close to (t)e{0) = (1 + 6)~^, the Laplace 
transform of the NE(1) distribution. 
Let 

5 = (g : [0,oo) R : |l.g|lg := sup0-2|.g(0)| < ool , 

and let 

W = {X : [0, c3o) -> R : x{0) = 1-0 + 9{0) for some g £ G} . 
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Then Ti contains all Laplace transforms of probability distributions with mean 1 
and finite variance. On H, define the operator ^ by 



m 



where 



f{s) = se^"^^-') 



is the probability generating function of 1 + Po (2/9), and m = 1 + 2p > 1. Thus 
the Laplace transform (f)p of interest to us is a fixed point of ^. 

Lemma 3.11 The operator ^ is a contraction, and, for all x, '^i' € W, 



Proof: For all x, V' S W and ^ > 0, we have 



f(x 



f 



< 



X I — I - V' , 
m / \m 



X I — I - , 



< m-^llx 



X - -V' - 

m J \m, 



as required. 



Lemma 3.12 For the Laplace transform <pe, we have 

2p2 



(l + 2p)2- 



Proof: For all 6* > 0, we have 



m + 6 



< 



1 1 
l + 9¥ 
1 

2(1 + 6)6'^ \m + ej ' 



- 1 



2^61 
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using the inequality |(l + x)e ^ — 1| < ^ for a; > 0. The lemma now follows 
because m + 9 > m = 1 + 2p and 1 + 6* > 1. [] 

Lemmas 13. 1 II and 13 . 1 21 together yield the following result. 
Lemma 3.13 For any p > 0, 

Proof: With Lemmas 15. Ill and ITT^ it follows that 

\\(kp-<Pe\\g = ll*0p-0e||e 

< II *0p-*</.e lie + 11 lie 
1 

< — II0P - (/-elle + 77— 

m (1 + 2p)-' 

Note that indeed (j)p ~ (j>e ^ Q- Thus, since m > 1, it follows that 

lU A. II <r P 
||0p-0e||e < ^_i(i + 2p)2^TT^' 

as required. [] 

As an immediate consequence, C{Wp) — > NE(1) as p 0. Theorem 13. 141 re- 
formulates this convergence as a pointwise comparison theorem directly relevant 
to the distribution functions of A and T. 

Theorem 3.14 Let W, W be independent NE (1) random variables. Then, for 
all 9 > 0, we have 

4P a2 



< 



1 + P 



Proof: We have 
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Since 



Eq^,{9W') = Ee 



-9WW' 



we obtain from the triangle inequality, (|3.3|) and Lemmas 13.111 13.121 and 13.131 
that 

2p2 



< 



< 



< 



1 



(l + 2p)2 



1 



1 



P 



2p2 



26^1 + p) 

l + 2p \Vl + 2p^ V l + 2p^ (l + 2p)2 

4P n2 



l + 2p ' 



as required. 



Noting that 



-dWW 



1 + % 



dy, 



we obtain the following theorem. 



Theorem 3.15 As in Theorem \cl.!A let A be a random variable on Z with 
distribution given by 



P [A > a;] = E{ 



,-24,l{i+2prw,w,' 



}■ 



Let T denote a random variable on R with distribution given by 



P[T> z] = 



1 + 2ye^ 



■ dy. 



Then 



sup |P[pA > z]-P[T> z]\ = (pi/='(l + log(l/p)) 



Proof: We use an argument similar to that used for Theorem 13. 91 For a large, 
we can use the bound 



1 + ay Jo 1 + ay 



(3.15) 
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from which, for z > and with c(p) defined by 

l>c(p) (2p)-ilog(l + 2p) > 

we have 

P[r > zc{p)] < e-'''<p\l + zc{p)) < (1 + zc(p))e-2-(i-p). (3.I6) 

Similarly, from Lemma 13.71 we have 

P[pA>z] < (l + 2p)-("/'')+2(l + zc(p)) < (l + 2p)2(l + zc(p))e-2^(i-''). 

Complementing these upper tail bounds, from Thcorcm l3. 14l and for z G pZ, 
we have 



P[/9A > z] - 



dy 



< 



16p 
l + 2p 



(1 + 2^)2^/" < 



16p 



l + 2p 

(3.17) 

Using the facts that (1 + 2py/P = e^^^t'') and that (1 + 2p)-^ < 0o < 1, and 
because, for a, 6 > 0, 



1 + ay 



dy 



1 + % 



< 



\b-a\ 



it also follows that 



/o l + 2#2(i + 2p)-/P 
<2|02_i|(i + 2pr/P< 

and then, from H3.18|) and (|3.15|l . we have 



max{l, a, 6} 
dy - P[To > zc(p)] 



(3.18) 



l + 2p 



(3.19) 



|P[T > zc(p)] - P[r > z]| < min{4z(l - c(p)), (2 + z)e-'} = 0(plog(l/p)). 

(3.20) 

Combining the bounds l|TT7|l . l|TT^ and (j?:^ for e^^ < p-^/^ gives a 
supremum of order p^^^ for |P[pA > z] — P[r > note that z may actually 
be allowed to take any real value in this range, since T has bounded density. 
For any larger values of z, the upper tail bounds give a maximum discrepancy 
of order 0{p^/^(l + log(l/p))}, as required. Note that, in the main part of the 
distribution, for z of order 1, the discrepancy is actually of order p. [] 
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Numerically, instead of calculating the limiting distribution of Wp, we would 
use the approximation 



from 13.8|l and H3.11|) . where the distributions of W{Tr) and W'{Tr') can be 
calculated iteratively, using the generating function from Lemma |3. 41 As D is 
centred around 2no ~ 2[yJ: and as r is of order at most i^^fj^^, only order 



4 The discrete circle model: description 

Now suppose, as in the discrete circle model of Newman et al. JSIj that the 
circle C becomes a ring lattice with A = Lk vertices, where each vertex is 
connected to all its neighbours within distance k by an undirected edge. In 
the notation of , a number of shortcuts are added between randomly chosen 
pairs of sites, with probability per connection in the lattice, of which there 
are Afc; thus, on average, there are Akcf) shortcuts in the graph. In contrast to 
the previous setting, it is natural in the discrete model to use graph distance, 
which implies that all edges, including shortcuts, have length 1. This turns out 
to make a significant difference to the results when shortcuts are very plentiful. 

For ease of comparison with the previous model, which collapsed the k- 
neighbourhoods, we adopt a different notation. The model can be formulated 
as the union of a Bernoulli random graph Ga,.^ and the underlying ring lattice 
on A vertices. Here we write cr = ^, so that the expected number of edges in 
Ga.^ is close to the value Lp/2 in the previous model; comparing the expected 
number of shortcuts with that given in T3) . we also have 




iterations would be needed. 



log(l+2p) 



Afcc/. = iA(A - 2fc - 1)- ^(A - 2A: - 1) « ^Lp, 
relating our parameter a to those of |13| . In particular, we have 




(4.1) 
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The model can also be realized by a dynamic construction. Choosing a 
point Po € {l,---,-^} at random, set -R(O) = {Pq}- Then, at the first step 
(distance 1), the 'island' consisting of Pq is increased by k points at each end, 
and, in addition, m[^^ Bi (A — 2A; — 1, ^) shortcuts connect to centres of new 
islands. At each subsequent step, starting from the set R{n) of vertices within 
distance n of Po, each island is increased by the addition of k points at either 
end, but with overlapping islands merged, to form a set ii'(n + 1); this is then 
increased to R{n + 1) by choosing a Bernoulli—^ thinning of the edges joining 
R{n) \ R{n - 1) to C \ R'{n + 1) as shortcuts. 

The branching analogue of this process, which agrees with the current pro- 
cess until its first self-overlap occurs, has individuals, here representing the 
islands, of two types: newly formed type 1 islands, consisting of just one vertex, 
and existing type 2 islands. A type 1 island at time n becomes a type 2 island at 
time n+l, and, in addition, has a Bi (A, ^)-distributed number of type 1 islands 
as 'offspring'. A type 2 island at time n stays a type 2 island at time n+l, and 
has a Bi(2A;A, ^)-distributed number of type 1 islands as offspring. Each new 
island starts at an independent and randomly chosen point of the circle, and at 
each subsequent step acquires k more vertices at either end. Writing 



for the numbers of islands of the two types at time n, where the superscript T 
denotes the transpose, their development over time is given by the branching 
recursion 



M(n) := iM^^\n),M^^\n)f, 



n > 



M(i)(0) 



Bi (n - 1) + 2fcM(2) (n - 1))A, ^) 

M(i)(n- 1) + M(2)(n- 1) : 
1, M(2)(o)=o. 



(4.2) 



The total number of intervals at time n is denoted by 



M+{n) = M^^\n) + M^'^\n), 



(4.3) 
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and the total number of vertices in these intervals by 

n-l 

s{n) = M+{n) + 2k^M+{j) > M+{n). (4.4) 

3=0 

As before, we use the branching process as the basic tool in our argument. 
It is now a two type Galton- Watson process with mean matrix 



A = 



a 2ka 
1 1 



The characteristic equation 

(t - l){t - a) = 2k(j (4.5) 

of A yields the eigenvalues 



A = Ai = i{cr + 1 + v/(o- + 1)2 + 4a(2fc - 1)} > cr + 1; 



-A < A2 = i{cr + 1 - y/{a + 1)2 + 4CT(2fc - 1)} < : 
also, from 14.5|l . 

A + A2=o- + l and AA2 = -o-(2fc - 1). (4.6) 

From the equation JA = A/, we find that the left eigenvectors f'^^\ i = 1, 2, 
satisfy 

/« = (A,-a)/«. (4.7) 

We standardize the positive left eigenvector /^^^ of A, associated with the eigen- 
value A, so that 

/l'^ = (A-<t)-^ ^ (A-a)^; (4.8) 



for /(2), we choose 



Then, for i = 1, 2, we have 

E((/W)^M„+i|J^(n)) = (/«)^AM(n) = A.(/«)^A^(n), 
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where J^{n) denotes the cr-algebra a{M{0), . . . , M{n)). Thus, from (|4.7|) . 

W^W(n) ~ A7"(/(*))^Af(n) (4.9) 
= Ar"/f'(M« + (A.-a)Af(2)) 

is a (non-zero mean) martingale, for i = 1,2; we let 

Wk,a ■■= lim VF(i)(n) a.s. = lim X^'\f^^^f M{n) a.s. (4.10) 

be the almost sure limit of the martingale W'-^^n). 

Our main conclusions can be summarized as follows: the detailed results 
and their proofs are given in Theorems 15.61 and 15.91 Let denote a random 
variable on the integers with distribution given by 

2 



P [Ad > a;] = Eexp < - 



(a^) (^-^)(2A-'T)'/'?A-W^fe,.W^^,J, (4.11) 



for any x G Z, and set D* = Ad + 2nd. Here, Ud and <j>d are such that A""* = 
</)d(A(T)^/^ and A~^ < (t>d < 1. Let D denote the graph distance between a 
randomly chosen pair of vertices on the ring lattice C. 

Theorem 4.1 If Aa oo and p — ka remains bounded, then it follows that 
dry ('C(I?), £(-D*)) ^0. If p 0, then pAd — *"d T, where T is as in Theo- 
rem \2.l\ 

Note that the expectation in (|4.11|) is taken under the initial condition l|4.2|) : 
we shall later need also to consider the distribution of Wk^a under other initial 
conditions. 

5 The discrete circle model: proofs 

We begin the detailed discussion with some moment formulae. 
Lemma 5.1 For the means, 

EM(i)(n) = ^^(A"(a-A2) + A^(A-a)); 
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A — A2 

EM+(n) = ^^(A"+i -A^'+i) <2A", (5.1) 



and 



E(MW+2fciV/(^)) = _2_{(i_A2/<7)A"+i + (A2/a)(A-a)A^} 

< cA", for c = Ak-l. (5.2) 

For i/ie variances, for j < n, 

Var(M/«(j)-W^^'H^)) <'^'(/^")'A-^ ^= (5.3) 

anrf, /or M'^ [n) , 

Yai M+{n) < A^"; (5.5) 
EAf+(n)(M+(n) - 1) < 4(1 + k2)A2". (5.6) 

Note that, from H4.5|l . we have 

o c 4fc - 1 , , 

Proof: First, observe that 

EW^W(n) = = (/«)^A^+ = (/«)^(1,0)^ = /f) (5.8) 

for aU n, by the martingale property. From (|4.9|l and (|4.7() . we have 

(/(i))-iA"W^(i)(n) = A#(i)(n) + (A-a)Af(2)(n); 
(/fVA2W^(''(n) ^ M'^^\n) + {X2~a)M^^\n), 

and thus 

M(i)(n) = A'W(i)(n)^^^ + A^W(2)(n)^^^; (5.9) 

(A-A2)/« (A-A2)/P) 



M(2)(^) = A'W(i)(n) i y^w^^^n) 



(A-A2)/f^ (A-A2)/(')' 
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From H5.9|l and l|5.8|l we obtain 

giving H5.1|l : for the last part use cr + 1 — A = A2 and ct + 1 — A2 = A, from 1)4. 6f) . 
Then (|5.2|) follows immediately, using 1)4. 5|l and ()4.t)|l . 
Now define 

:=M(^)(n)-CT(Af(^)(n-l) + 2fcM(2^(n-l)), ?i > 1, (5.10) 

noting that it has a centred binomial distribution conditional on J-{n — 1); 
representing quantities in terms of these martingale differences greatly simplifies 
the subsequent calculations. For instance, 

= Ar"-i/«{M(i)(n + 1) + (A, - cT)M^^\n + 1) 

- A,M(i)(n) - A,(A, - a)M(2)(„)} 
= A-"^Vf^{M(i)(n + 1) - aM^^Hn) - 2kaM'-^Hn)} 
= Ar«-V«X(n+l), (5.11) 

where we have used (A^ — ^)(\ ^ o") = ^ka, from 14. 511 . and the branching 
recursion. 
Since 

^{X\n + 1) I Tin)} = ^ (1 - ^) (Af(i)(n) + 2fcM(2)(n))A, 

we have 

F,X^{n + l) < caX", 
from 1)5. 2(1 . Thus, immediately, 

E{(VF«(n + 1) - W^W(n))2} < c(T(/f VV'^'^A". (5.12) 

Hence, for i = 1, 2 and for any < j < n, 

Var {W^'\j) - W^'\n)) 
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= E{W^'\k) - + 1))2 

k=j 

k=j 

< caif^'^rX-' (^)' niin |^^, [n ,)} max (^1, (^)"") , 

and the formulae H5.3|l and (|5.4|l foUow. 
Moreover, from H5.9|) . 

M+H = ^irW^^'^H - ^W^'Hn) , (5.13) 

and hence 

n 

(A - A2)'VarM+(n) = E(A"+i-^ - A2+'"^)'VarX(j) 

n 

< 40(7^ A^"- (J < 4ctTA2"+V(A- 1). 

From this, using the inequahty 

EM+(n)(M+(?i) - 1) < Var(M+(?i)) + (EM+(?^))^ 
H5.6|l is easily obtained. [] 

As in the previous section, we run two branching processes M and M' =: N 
independently, and investigate the time at which the first intersection occurs, 
irrespective of the types of the intervals. We write s'(n) —: u{n), and use 
notation of the form Mr to denote M{nd + r), for an appropriate Ud which we 
shall define later; we also use :— {2k{nd + r) + 1} to denote the length of 
the longest interval in the branching process at time Ud + r. Then, with Vry 
defined as before to be the number of pairs of intervals of M and N intersecting, 
when M has been run for time Ud + r and N for time + r', the analogue of 
Proposition 13.31 shows that 

\P[Vr,r' = 0] - P[D >2nd + r + r']\ 

< 32A-V2.^,,)E{iM+iV+(M+ + N+ - 2)}, (5.14) 
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and that of Corollary 13. 21 gives 

\P[Vr,r' = I M+ = M, N+ =N,Sr^ t, U,.- = u\ 

- exp{-A"\iVt + Mu - MiV)}| 

< 8A"i(Af + iV)T(rvr')- (5-15) 

The estimates ()5.14|l and ()5.15f) can be made more explicit with the help of the 
bounds 

EA#+ < 2A"''+''; EAf+(Af+ - 1) < 4(1 + k2)A^("'^+''\ (5.16) 

which follow from from Lemma 15.11 together, they give the following result, 
in which D denotes the shortest distance between Pq and a randomly chosen 
vertex P' of C. 

Lemma 5.2 With the above notation and definitions, we have 
\P[D>2nd + r + r']-P[Vr,r' =0]\ 

and 

\P[Vr,r' = I M+ ^ M, N+ = N, Sr = t, Ur' = u] 

- exp{~A-^{Nt + Mu - MN)}\ 

< 32A-V(^vr')A""+*'''"''^. 

We now need to examine Eexp{— A^^(A/^^Sr + A/+{tr' — Af+A/^)} more closely. 
To start with, from H5.1|l in Lemma l5. II we have 

E§{n) = ^2_|(A"+i_Ari) + 2fcXi(A^+'-Ar')| 
1 ^"+1 (^1 + ^ V A?+i fl 



A - A2 1 V A - 1 y ^ V A,. - 1 

A A2 



2k 



A - 1 A9 - 1 



-^^{A-^-Ar^-(A-A.)}, (5.17) 
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where we have used H4.5II and (|4.t)|l to simpUfy, and this expression is rather 
close to (A/(T)EAf as given in H5.1|) . This reflects the fact that both s{n) 
and (A/cr)M+(n) are rather close to 

A"+2 1) 
<J{X-X2) Jp ■ 

Lemma 5.3 We have the following approximations: 
w/iere 

E|?7i(n)| < {3 + 2Atv/^rT^}max|A-i/2^^| ; (5.18) 

E|C/2(n)| < {1 + 2/tVn + A- l}niax|A-i/2^-^| . (5.19) 

Proof: We first express s{n) and (A/f7)M+(n) in terms of the martingale dif- 
ferences {X{1), I > 1}. From (|5.13|l and 15.11|l . we have 

\n+l 

^ -M/«(n-l)+X(n) ' 



(A - A2)/f ^ A - A2 

Similarly, from (|5.13(l and l|5.11|l . 



' ri-l \ 

1 + J2X^'X{1)\ 



j=0 

A - A2 ^ 1 f(i) " f(2) 

^ 7 = I Jl Jl 



3- 

n-1 



A - A9 ^ I 

=0 I i\ 



A^M 1 + E^2"'^W 
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— [-^ — - — [P '") 

n— 1 l — l \ n— 1 n 

1=1 j=Q ^ 1 = 1 j=l+l 



n-1 



\n+l / n-1 
^2 



A,- 



- 1 + 5]A2-'X(0 



Substituting these into (j4.4(l . and because 1 + 2k/{Xi — 1) = Ai/fi, i — 1,2, 
from H4.5|l and H4.6|l . we obtain 



\n+2-i\ \n+2 
-^2 \ , ^ , ^2 



A — Ao / A — Ao 



where 

n+l 



|E[/i(n)| = A-"-2|^_^2^^«+2| <3i^ax|A-\^ 



and 



VarC/i(n) < A-2"-4ca(A - Aa)^ << a^A"-! + ^ A'-M 1 ' 



n+2-i \ ^ 



< 4K2(cr^ +n)max|A"\^| 



2 ^ "+i 



giving the first approximation. By similar arguments, for (A/(T)M+(n) we ob- 
tain 

-M+{n) = — — \^ ^ 



(T \ A — A2 A — A2 



a{X - A2) \^ /( 



AT) + U2in) , 
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where 

|EC/2(n)| = (|A2|/Ar+i 

and 



Var{/2(n) < X'^''~^ca{X - X2)U X''-^ + J2 



2 



< 4k^(A- l + n)max<j A"\^ , 



giving the second approximation. 



We now use these approximations as in the previous section, starting by 
observing that 



s{n, n') 



Eexp{-A"i(iV+(n')sN - Af+(n)iV+(n'))} 

Eexp|-(^^-^^ (A-(T)(2A-a) f^^J W^^\n - l)W"^^\n' - I) 



\3 \n+n' r ^ ^ 

- a{X-X,y ~ir {m[in')\m'"M+{n)} + E\U2{n)\ 

+ E|{7i(n)|E{A-"'7V+(n')} + E|;7^(n')|} 

A V A"+"' 



{E|;72(n)|E{A-"'7V+(n')} + E|C/^(n')|} 



A - A2 y A 

since EW^(i)(m) = f[^'^ = (A - cr)"!/^ for aU m. Since also, from 
A~'"EM+(to) < 2, it follows from Lemma 15.31 and because A > 1 + cr > 1 
that 

A3 A"+"' 



e 



< {17 + 18K^/{n V n') + ((t2 V (A - 1))} 



(A-A2)2 Aa 

I \ n ("^"') 

A-i/2,^l . (5.20) 



Then similarly, since (W^^'^{j) — Wk,a)/ fi''^ has mean zero and, letting n ^ 00 
in 1)5.3(1 . variance at most k^A"^ , it follows that 

Eexp|-(^^^^ {X-a){2X-a) (^^^ W^'\n - l)W'^'\n' - 1)^ 
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< 2 (xA^) ' (A - o){2\ - a) [^^^ «;A-«"^"')-i>/2_ (5 21) 

So choose Tici so that A"'' = ^^(Acr)^/^ with A^^ < (/id < li and let n = rid + r, 
n' = + r' ; then define the quantities 

V'i{r,r') := 2560;^M2fc(nd + (r V /)) + 1)}'(1 + k')A'-+'-'+('-^'''); 
T,^(r,r') := 32M^(2fc(nd + (r V r')) + 1)}A('^^'-'); 

and 

77^ (r,r') := 18,^^-^Ai-^{l + K{nd + (r V r') + (tr^ V (A - i)))i/2}^r+r'-7(rAr')^ 
where 

7 := 7(fc,fT) :=min{i,(log(A/|A2|)/logA}. (5.22) 

Note that, for fixed ka — p, simple differentiation shows that Ai is an increasing 
function of a and IA2I a decreasing function, so that Ai(cr) > Ai(0), |A2((t)| < 
|A2(0)|, and hence 



log(A/|A2|) _ ^ loglAzI ^ ^ logiVTTS^- 1) ^ 1 
log A log A - log(Vl + 8p+l) ~ 2 

in p < 1. Thus, for p < 1, we have 7 = ^• 

Then, from Lemma lS^ and H5.20|l and H5.21|l . we have the following analogue 
of Theorem 

Theorem 5.4 With the above assumptions and definitions, for a; € Z and r = 
r{x) — [a;/2j , r' — r'{x) — x ~ r{x) < {x + l)/2, we have 

2 



P[D > 2?i<j + x] - E exp { - 



(a^) (A-^)(2A-^)'/'?A-W^fc,,W^,. 
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{X-a)i2X-a)<t>lX''Wk,.Wi 



In particular, if p < 1 , 

P[D > 2nd + x] - Eexp |- (^^^^ 

< Wi{r,r')+v'2{r,r'mA<j)-^/' + {v',{r,r') + r^i{r,r')){Aar^/\ 

Note that the expectation in Theorem 15.41 is taken conditional on the initial 
condition Mq = e*^^^ 

The theorem can be translated into a uniform bound, similar to that of 
Theorem El To do so, we need to be able to control E{e"'''^'='<''^^.- } for 
large ip. The following analogue of Lemma \'A . 71 makes this possible. To state it, 
wc first need some notation. 

For Wk,a as in (|4.1U|) . let (j)k^cr ■— {(l)i,4>2) denote the Laplace transforms 

Me) = E{e-''(-^i")"''^'=- |Mo-eW}; (5.23) 
MO) - E{e-^(/i'')"'^'=- |Mo = e(2)} 

of Ci{f[^'')^^Wk.a-)i where e^'^ is the i'th unit vector. Although we now need 
to distinguish other initial conditions for the branching process, unconditional 
expectations will always in what follows presuppose the initial condition Mq = 
e^^\ as before. Then, as in Harris ^Uj, p. 45, (f>k.cr satisfies the Poincare equation 

M>^0) = 9\Md),M&)) in m>0; i ^ 1,2, (5.24) 
where is the generating function of AIi if AIq ~ e*^*): 

oo 

g\sx,S2)^ ^ p^(ri,r2)s^is2% 

where 'p'^(r\, r2) is the probability that an individual of type i has r\ children of 
type 1 and r2 children of type 2. Here, from the binomial structure, 

and 

g^[si,s2) = *2 (^X*^ ^ ^ A J 
The Laplace transforms 0fe.cr can be bounded as follows. 



34 



Lemma 5.5 For 6,a > 0, we have 

1 



1 + 6'' 

A^a-M^) =■■ MS) < ^ 



i + e{x-a)' 

and hence 

E|g-K/f') ''^'=-<.<' |M(0) = M'(0) =e(i)| < 9-Hog{l + e). 
Proof: We proceed by induction. Put 

<Pi,n{e) = E(e-^(A''')"'^'''(")|M(0)=eW), i = 1,2. 



Then 



Assume that 



1 + 61' 



1 + 6'(A - a) 
By the Poincare recursion, 



0i,n+l(6') = 5' (^01," (^^j ,«i'2,n (^^ 
for « = 1,2. Hence, using the induction assumption, 

^ A A + ^ 



A(l + 61) - 6'a A + 61 + 6»<T 

A(A + 9) 

A(l + 6')(A + 0) + 6I2(A -l-a)a 
1 



< 

- l + 6»' 
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and, also from (|4.5|) . 



< 



A + 0(A-a) V^ + ^ 

A x + e 



x + e + e{x-c7-i) x + e + 2k(je 
X x + t 



< 



x + e + e{x-cT-i) x + e + {x- i)(a - a)9 

A(A + g) 

A(A + e){l + 0{X - a)) + 02(A - 1 - (7)(A - cr)(A - 1) 
1 



l + 0{X-<7)' 

Taking limits as n — > cx) proves the first two assertions. The last assertion 
follows as in Lemma [3.71 [] 



P[Arf>x]=Ecxp<'-(-^) {X-a){2X-a)<j>lX-Wk,.Wi,}, x € Z, 



Theorem 5.6 Let denote a random variable on the integers with distribu- 
tion given by 

V ^ 

.A-A2, ^ 

(5.25) 

and let D* = A^; + 2nii. Then 

dTv{C{D),£{D*)) = O (log(Aa)(Aa)-^/(4-^)) , 

uniformly in A, k and a such that ka < pQ, for any fixed < po < oo, where 7 is 
given as in \5.2^) . Hence dTvi^{D), C{D*)) if Aa — > 00 and kcr remains 
bounded. In particular, 7 = 1/2 i/ fccr < 1, and the approximation error is then 
of order 0{log{Aa){Aay^/''). 



Glog(Aa-)-21og0 



; set r{x) 



Proof: Fix G, and consider x satisfying x < j^^^ 
[a;/2j, r'{x) — x — r{x) < (a; + l)/2. Then it follows from Theorem 15.41 and 
(EZj) that 

|P[L» > 2nd + x]-P[D* > 2nd + x]\ 

< {v'Mx),r'{x))+rj',{r{x),r'{xmAa)-'/^ 
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+77^ (r (x) , r' (x)) (Aa)-i/4 + ^^(^(^) , / (x)) (Aa)-^/^ 



O 



/ fcalog(Aa) \ (i_3G)/2 

+ A3/4(A~a)2(Aa)-(i-3«)/4 



+ Ai-^/2 



1 /2 

/ l0g(Aa) ^ ,2 ^ _ A (^_G(2-^))/2 

V logA V 



Also, from Lemma recalling that (/i ) ^ = A — a, we have the upper tail 
estimate 



P(Arf > x) 



0(log(AtT)(ACT) 



Comparing the exponents of Act, and remembering that 7 < 1/2, the best 
choice of G is G = 7/(4 — 7), making G = (7 — G(2 — 7))/2; noting also that 
A = 0(1 + CT + \/A?ct), the theorem is proved. [] 

Remembering that the choices ka — p and A = Lk match this model with 
that of Section 12 we see that Act = Lp, and that thus Theorem 15.61 matches 
Theorem 13.91 closelv for p < 1, but that the total variation distance estimate 
here becomes bigger as p increases. Indeed, ii p ^ 00 and ct — 0{k), then 
7(fc, ct) — + 0, and no useful approximation is obtained. This reflects the fact that, 
when IA2I/A is close to 1, the martingale W^^''{n) only slowly comes to dominate 
the behaviour of the two type branching process; for example, from (|5.13|l . 

then retains a sizeable contribution from 

iy(2)(„) until n becomes extremely 
large. This is in turn a consequence of taking the shortcuts to have length 1, 
rather than 0; as a result, the big multiplication, by a factor of 2p, occurs 
only at the second time step, inducing substantial fluctuations of period 2 in 
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the branching process, which die away only slowly when p is large. However, 
ii p —^ oo and k = 0{a^^^) for any e > 0, then liminf 7(fc, cr) > 0, and it 
becomes possible for C{D) and C{D*) to be asymptotically close in total vari- 
ation. This can be deduced from the proof of the theorem by taking k ^ L°' 
and a ^ for choices of a and (3 which ensure that dominates p. Un- 

der such circumstances, the effect of two successive multiplications by a in the 
branching process dominates that of a single multiplication by 2p at the second 
step, and approximately geometric growth at rate A ~ cr results. However, as 
in all situations in which p is a positive power of A, intcrpoint distances are 
asymptotically bounded, and take one or at most two values with very high 
probability; an analogue of CoroUarv 13.101 could for instance also be proved. 

li p = ka is small, we can again compare the distribution of Wk.a with the 
NE(1) distribution of the limiting variable W in the Yule process (see 0), using 
the fact that its Laplace transforms satisfy the Poincare equation H5.24|l . Define 
the operator S by 



(S0)2(0) 9 



a; '"^ 

e 

A 



2 T -r<Pi T 



A / V A"' V Ay ■ A 



2kK 



Let 



{7 = (71,72) : [0,00)2 ^ R : 1,^1,, sup max { M^MZ^M | < 00 
and 

n := jv' = (?Ai, V'2) : [0, oo)2 ^ R : "^'^^^^ g2^^ ~ bounded, 
MO)-il - OiX^^)) ,3 bounded! . 
Then H contains (f>k,a — {4'i,4>2) as defined in (|5.23|l . since 
E{{f['Y'Wk,a I M(0) - e(i)} = 1; I]{if['Y'Wk,a \ M(0) = e^^)} = A - a. 
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and taking limits in H5.3|l shows that Va,r Wk.a exists. We next show that S is 
a contraction on Ti. 

Lemma 5.7 The operator 'E. is a contraction on Ti., and, for all ijj,x (^"H, 

'2ka+ r 



\^->p-^x\\g < 



A2 



Remark. Note that 



2/ca + l A2 - (A- l)(cr + 1) 



< 1. 



A2 A2 
Proof: For all tpjX ^ 'H and 9 > 0, observe that t/j — x ^ G- We then compute 

mU9)-iEx)i{e)\ < 



so that 



V'2 T ^ X2 T 



+0- 



^1 T - XI T 



mU9)-{Exh{0)\ < 1 k2(f)-X2(f)| 



02 



A2 



(f) 

-l^i(f)-xi(f)| 



< 



A2 

(J + l 

A2 



xWg- 



Similarly, 



\{E^U9)~{Ex)2{9)\ < 



V'2 T - X2 T 



+2ka 



■01 T ^ Xi T 



and 



\m,{9) - {Ex)2{9)\ 

02 



< 



2ka+ 1 
A2 



IIV'- xll 



Taking the maximum of the bounds finishes the proof. 



(5.26) 



Thus, for any starting function t/j — {■ipi,'ip2) G Ti. and for (j)k,a 
given in (|5.23|) . we have 

Hk.a-ipWg < - Sf/'llg + IIS?/; - vile 

2k(7 + l 



< 



A2 



Hk,a - -0115 + IISV'- VII 
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so that 



A2 



(5.27) 



A2 - (2/ccr + 1) 

Hence a function t/i such that — is smaU provides a good approximation 

to 4>k,a- 

As a candidate ■0, we try 

1 



Aim = Y 



1 



i + e{\-ay 

Lemma l5 . 51 shows that this pair dominates (pk.a- 

Lemma 5.8 For ip given in f5.28\) . we have 

2ka{X^ - Act - 1 + fccr) 



Proof: For > 0, we have 
A 



A2 



1 - 



a9 



AiX + 0) 



1 



x{i + e) 



1 + 61 iA + 6'(A-CT) V (A + 6*) 



1 - 



a9 



-1>+Ri 



where 



Moreover, 



Ri 



X 



X + 9{X-cr) 



1 - 



A{X + 0) 



- 1 



a0 
A+1 



X{l + 9) 



(70 



1 + 61 tA + 6i(A-CT) V A + 

e^ajl - A) 

~ (1 + 0)(A + (A-ct)0)(A + i 

From Taylor's expansion, it follows that 

AA(A- 1)ct26'2 



1^1 1 < 



2(A+(A-ct)0) A2(A 



< 



2A2 



(5.28) 
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Hence 



612 - 2A2 

Similarly, 

{~i;)2{e) - MO) 

A / aO 1 



1 



\ + e{\-a)\ A{X + e)J l + 9{X-<7) 

2kcje\ 



1 f A(l + g(A-a)) / 

l + 9{\-a)\ X + e{\^<j) \ 



x + ej 



2, 



where 



A{X + e)J A + i 



A + 6i(A - a) 
Using (|4.5|l . we obtain 

1 [ A(l + g(A-g)) / _ 2ka9\ _ 

g2(A- l)(A-g)(l + Ag- A2) 
{l + e{X~a)){X + (X~a)e){X + 0) 

2fccre'2(l + Acr- A2) 
(1 + e{X - a)){X + (A - a)0){X + 9) ' 

From Taylor's expansion, it now follows that 

2fcA(2fcA- l)Acr26l2 



1^2 1 < 



< 



2{X+(X-<j)0) A2(A + 0)2 

2fc2(T202 



A2 
Hence 



(S^)2 (0) - V2 (0) I ^ 2fca(A2 - Aa - 1 + fca) 



02 - ^2 ' 

completing the proof, since 

2(A2 - Xa-l + ka) = 2(A - 1) + 2(3fc - l)cr > 2(A - 1) + a. 



This enables us to prove the exponential approximation to C{Wk.a) when ka 
is small. 
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Theorem 5.9 As ka 0, £{Wk^a) ^ NE(1). 

Proof: Let 0^. g- be as in (|5.23|) . and ip as in H5.28|) . Then {(pk.cr)i is the Laplace 
transform of 



and Vi that of NE(1), and (-^ - cr)^''^ ^ 1 as fccr -> 0. Hence it i 



IS 



enough to show that 



hm ||(/)fc.cr - ip\\g = 0. 



However, using Lemma [5 .81 and l|5.27(l . we obtain 

\\<Pk.a - V'b 

<2fc.^!-^^^l±^ 
A2 - 1 - 2/ccr 

2ka[<7 + 1) 

< fccr(l + 2fccr)(5 + 2o-) ^0, (5.30) 
since /ccr — » 0. This proves the theorem. Q 

Again we can use this resuh to derive an approximation to the distribution 
of the distance for D, based on a corresponding distribution derived from the 
NE(1) distribution. The starting point is the foUowing resuh. 

Theorem 5.10 Let W, W be independent NE( 1 ) variables. Then, for all 9 > 0, 

Eexp {^e{X - a)Wk,.Wi,} - Ee'^'^^' 

< 56'2A;cr{(3cr/2) + (l + 2fccr)(5 + 2(T)}. 

Proof: As in the proof of Theorem 13.141 with (f>k,a as in (|5.23() and V as 



m 



and because, from (|4.8|l . X — a = (/f^'') ^, we have 



Eexp {-0(A - - Ee 
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Now 



E{{cf>k,a)i(0{f['Y'Wi,)}-EMOW) 
+E {^1 V^M^L)} - ^MOW). 

in the proof of Lemma l5 . 71 gives 

{(S0)i(^?(/(^Vi<j}-e{(S^.,.)i(0(/1^Vi<j} 



E 



< 
< 



A2 



from lUTni, and lf?T|l . and 

can be bounded by 



then implies that the above expression 



3^ ^ 



fccr(l + 2fc(T)(5 + 2cr). 



Similarly, from (|5.29|) in the proof of Lemma 15.81 



< w 



,2 a{2(A-l)+a} 



2A2 



Then, with W ^ NE (1) independent of W'^, ^, we have 

and hence, from (|5.30() in the proof of Theorem 15.91 it follows that 

E[>PmfiY^Wia)}-^i'i{OW)\ = |E{(0fe,,)i(0W^)}-E^i(0M^)| 

< 26*2/0(7(1 + 2fccr) (5 + 2cr). 

Since A — 1 < 2k(j and A^ > cr + 1, the assertion follows from the triangle 
inequality. [] 



Recalling that 



Ee^ 



-eww 



lo 1 + % 

we can now derive the analogue of Theorem 13. 151 



dy, 
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Theorem 5.11 As in Theorem 15.61 let Ad denote a random variable on the 
integers with distribution given by 

2 

A - X2, 



Let T denote a random variable on R with distribution given by 

/■°° e-y 

/o 1 + 2ye^ 



Then 

sup|P[^Ad>z] -P[T>z]| = o|(fca)i/3(i + log(l/fca))|, 
uniformly in ka <2. 

Proof: We use an argument similar to the proof of Theorem 13. 151 Putting 

~f,, log A _ log(l + 2A^) 
A-1 2^ ' 

we have, as before, 

1>5(A)>1-^; 

we also write 

2 



We use the following bounds. First, from the characteristic equation 14. 5() . 
we have 

2fccr 

< A - 1 = < 2ka. 

X — a 

Then, since y/a + b < ^/a + \/b for a,b> 0, it follows that 



l + cr<A < l + a+ ^/a{2k- 1). 
Thus, and because X~^ < (f>d ^ ^ and A2 < 0, we have 



/3(A) < 1 + (a/2) + y/a{2k-l) 
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and 

/3(A) > 



1 



= 1 - 



X-X2, 

2a{Ak - 1) + cr^ 



(cr + 1)2 + 4cr(2fc - 1) ■ 
This in turn gives 

i«A)-ii < ma. {(,/2) + vTw^), } 

= : r(cr,fc) 

= O ^maxjcr, v^^}^ . 

For the main part of the distribution, we write 

P [^Arf >z]- P[T > z] 

= P [^Ad >-A- j (1 + 2y/3(A)e2-^(^) j dy (5.31) 

+ / (1 + 2y/3(A)e2^'^(^) j - P[r > z5(A)] (5.32) 

+ P[T>z6(A)]-P[T>z]. (5.33) 

Now, for (|OT|) . Theorem Eini yields 

P [^Arf > 2] - y (1 + 22//3(A)e2-'^(^) j 

= Eexp{-2/3(A)e2-(^)iyfe,,W^^,,}-y"e-2'(l + 2y/3(A)e2^^(^))"' dy 

< A(3{Xfe^"^^^'^5k(j{iSa/2) + (1 + 2fca)(5 + 2a)} 

< (2A - a)^e^^5ka {(3a/2) + (1 + 2fccr)(5 + 2a)} . 

With l|TTH|l . we have, for that 

(1 + 2y/3(A)e^"'^(^)) - P[r > zS(A)] 

< o,2.a(A) l/?(A)-l| 
max{l, 2/3(A)e2^2(^), 2e2^2(^)} 

2zc(A) 

maxjl, 2e^^'=*^'^-'| 

< r(cr,fc). 
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Similarly, for H5.33|l . because 1 — c(A) < < ka and from Taylor's expansion, 
it follows that 

|P[r> zS(A)] -P[r> z]| 

' / \ — 1 /"OO 

(1 + 2ye^^'^(^) j dy - j e"^ (l + 2ye^^) dy 





/•OO 




Jo 





1| 



U z > 0, this gives 

|P[r > zc{X)] - P[T > z]| < 2z(l - 5(A)) < 2kaz; 
if z < 0, we have 

\F[T > zd{X)] - P[T > z]\ < 2|z|(l-5(A))e-2^^(^) < 2/cfT|z|e2"(i-'^'"). 
Hence we conclude that, uniformly in ka < 1/2, 
P [^Ad >z]- P[T > z] 

< 5fccre*^(2A - erf {(3cr/2) + (1 + 2fccr)(5 + 2a)} 

+ r{a, k) + 2ka\z\ min{l, e^^'^^-^"'^} 

< Ci {fc(T(e-^^ + 1) + Vk^} , (5.34) 

for some constant Ci. 

For the large values of z, where the bound given in H5.34|l becomes useless, 
we can estimate the upper tails of the random variables separately. First, for 
X e Z, we have 

P [Arf >x]= Eexp [~2f3{X)X-{f['^)-^Wk^^Wi,} , 

so that, by Lemma [5. 51 it follows that 

P [^A, > z] 

= Eexp[-2P{X)e'^-'^^\fi'Y^Wk,.W^ 

< ' (2A - a)- V.-^e--^(^) log (^1 + (3^) ' (2A - a)<^^e-^(^)' 

< 4Ae-^^'^(^)log(l + (2A-a)e^"'^(-^') 

< 4Ae-2^(i-M log (1 + (2 + a + 2^a{2k - 1)) e'") , z e ^Z. 
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For the upper tail of T, as in and with z > 0, we have 

P[T>zc{X)] < e-2^'^(^)(i + zg(A)) < (l + 2)e-2-(i-'="). 

Combining these two tail estimates, we find that, for z > 0, 

|P[T>z5(A)] -P[r> z]| < C2(l + z)e-2^(i-'^'=\ (5.35) 

uniformly in ka < 1/2, for some constant C2. Applying the bound H5.34|l when 
z < (6 — 2k(j)^^ log{l/ka) and (|5.35|l for all larger z, and remembering that T 
has bounded density, so that the discrete nature of requires only a small 
enough correction, a bound of the required order follows. [] 

Acknowledgement. We would like to thank Svante Janson for very helpful 
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